Model Selection

Information Retrieval Optimization

# Information Retrieval Optimization

Finetuned Cross Encoder L6 V2

This is a fine-tuned cross-encoder model based on cross-encoder/ms-marco-MiniLM-L6-v2, primarily used for text re-ranking and semantic search tasks.

GTE ModernColBERT V1

PyLate is a sentence similarity model based on the ColBERT architecture, using Alibaba-NLP/gte-modernbert-base as the base model and trained with distillation loss, suitable for information retrieval tasks.

Reranker ModernBERT Base Gooaq Bce

This is a cross-encoder model fine-tuned from ModernBERT-base for text re-ranking and semantic search tasks.

Text Embedding English

Reasoning Bert Ccnews

This is a fine-tuned BERT-based sentence transformer model for mapping sentences and paragraphs into a 768-dimensional vector space, supporting tasks such as semantic text similarity and semantic search.

Reranker Bert Tiny Gooaq Bce Tanh V4

This is a cross-encoder model fine-tuned from bert-tiny for computing similarity scores between text pairs, suitable for tasks like semantic textual similarity and semantic search.

Text Embedding English

cross-encoder-testing

rank1-32b is an information retrieval reranking model based on Qwen2.5-32B, which judges relevance by generating reasoning chains

Large Language Model

Transformers English

rank1 is a 14-billion-parameter reasoning re-ranking model that improves the performance of information retrieval tasks by generating explicit reasoning chains before making relevance judgments.

Large Language Model

Transformers English

Namaa ARA Reranker V1

A model specifically designed for Arabic reranking tasks, capable of accurately evaluating the relevance between queries and passages.

Transformers Arabic

Arabic Reranker V1

This is an Arabic re-ranking model based on the BERT architecture, optimized for Arabic text relevance ranking tasks

Text Embedding Arabic

Llm2vec Meta Llama 31 8B Instruct Mntp

LLM2Vec is a simple method to convert decoder-only large language models into text encoders by enabling bidirectional attention, masked next-token prediction, and unsupervised contrastive learning.

Transformers English

Thusinh1969 Gemma2 2b Rerank Checkpoint 8800 Gguf

Text ranking model based on Gemma 2B architecture, offering multiple quantization versions to suit different hardware needs

Ruri Reranker Large

Ruri Reranker is a general-purpose Japanese reranking model based on the Sentence Transformers architecture, specifically designed for Japanese text relevance ranking tasks.

Text Embedding Japanese

Ruri Reranker Base

General-purpose Japanese reranking model for improving relevance ranking in Japanese text retrieval

Text Embedding Japanese

Ruri Reranker Stage1 Base

Ruri Reranker is a Japanese text reranking model based on Transformer architecture, specifically designed to optimize the ranking quality of retrieval results.

Text Embedding Japanese

Ruri Reranker Small

Ruri-Reranker is a reranking model specifically optimized for Japanese text, based on the sentence-transformers architecture, effectively improving the relevance ranking of search results.

Text Embedding Japanese

Ruri Reranker Stage1 Small

The Ruri Reranker is a general-purpose Japanese reranking model specifically designed to improve the relevance ranking of Japanese text retrieval results. The small version maintains high performance while having a smaller parameter count.

Text Embedding Japanese

A text ranking model fine-tuned with Korean data based on BAAI/bge-reranker-v2-m3

Transformers Supports Multiple Languages

This is a sentence-transformers model fine-tuned from dunzhang/stella_en_1.5B_v5, designed to generate 1024-dimensional dense vector representations for sentences and paragraphs, suitable for tasks like semantic text similarity and semantic search.

Monoelectra Base

lightning-ir is a cross-encoder model based on the ELECTRA architecture, specifically designed for text ranking tasks. The model optimizes paragraph reordering performance through large language model distillation techniques.

Large Language Model

Crossencoder Xlm Roberta Base Mmarcofr

This is a French cross-encoder model based on XLM-RoBERTa, specifically designed for passage re-ranking tasks in semantic search.

Text Embedding French

Venusaur is a sentence embedding model developed based on the Mihaiii/Bulbasaur foundation model, focusing on sentence similarity and feature extraction tasks.

Japanese Reranker Cross Encoder Small V1

This is a Japanese-trained Reranker (Cross-Encoder) model for text ranking tasks.

Text Embedding Japanese

A T5 architecture-based sentence embedding model focused on semantic similarity and information retrieval tasks for English text.

Transformers English

Simcse Small E Czech

A Czech sentence similarity model fine-tuned with SimCSE objective based on Seznam/small-e-czech model

Transformers Other

GTE-small is a general text embedding model trained by Alibaba DAMO Academy, based on the BERT framework, suitable for tasks such as information retrieval and semantic text similarity.

Transformers English

GTE-small is a compact general-purpose text embedding model suitable for various natural language processing tasks, including sentence similarity calculation, text classification, and retrieval.

Text Embedding English

GTE-Base is a general-purpose text embedding model focused on sentence similarity and text retrieval tasks, performing well on multiple benchmarks.

Text Embedding English

Instructor Large Safetensors

INSTRUCTOR is a text embedding model based on the T5 architecture, focusing on sentence similarity calculation and information retrieval tasks. It excels in various NLP tasks, including text classification, clustering, and semantic similarity evaluation.

Transformers English

This model is used to predict the relevance of query-document pairs, suitable for information retrieval tasks.

Doc2query T5 Base Msmarco

A document expansion model based on T5-base architecture, trained on the MS MARCO dataset, used to generate potential queries related to document content to enhance retrieval effectiveness

Transformers English

Mdpr Tied Pft Msmarco Ft All

This model is a dense retrieval model further fine-tuned on all Mr. TyDi training data based on the castorini/mdpr-tied-pft-msmarco checkpoint.

Large Language Model

Dense Encoder Distilbert Frozen Emb

Dense retrieval model based on DistilBERT architecture, trained on the MS MARCO dataset with frozen word embedding layers

vocab-transformers

Doc2query T5 Base Msmarco

A retrieval model that converts documents into queries to enhance document search relevance

Large Language Model

Duot5 Base Msmarco

A text re-ranking model based on the T5-base architecture, fine-tuned on the MS MARCO passage dataset to improve the relevance ranking of information retrieval results.

Large Language Model

Msmarco Distilbert Word2vec256k MLM 230k

This model is a pre-trained language model based on the DistilBERT architecture, initialized with a 256k vocabulary using word2vec and trained on the MS MARCO corpus with masked language modeling (MLM).

Large Language Model

vocab-transformers

Monot5 3b Msmarco

A re-ranker based on the T5-3B architecture, fine-tuned for 100,000 steps on the MS MARCO passage dataset for document ranking tasks.

Large Language Model

Monot5 Base Msmarco

A re-ranking model based on the T5-base architecture, fine-tuned for 100,000 steps on the MS MARCO passage dataset, suitable for document re-ranking tasks in information retrieval.

Large Language Model

S PubMedBert MS MARCO

A sentence-transformers model fine-tuned on the MS-MARCO dataset based on PubMedBERT, suitable for semantic similarity calculation and information retrieval tasks in the medical/health text domain

Bert Fa Base Uncased Wikinli Mean Tokens

A Persian sentence embedding model based on ParsBERT for generating high-quality sentence vector representations

Text Embedding Other

Bert Base Msmarco

A fine-tuned version based on the BERT-Base architecture for the MS MARCO passage classification task, suitable for document re-ranking tasks

Large Language Model

Featured Recommended AI Models

AIbase

Empowering the Future, Your AI Solution Knowledge Base

English 简体中文繁體中文にほんご

© 2025AIbase